Search CORE

35 research outputs found

La polysémie régulière dans WordNet

Author: Barque Lucie
Chaumartin François-Régis
Publication venue: HAL CCSD
Publication date: 09/06/2008
Field of study

International audienceThis paper presents an analysis and modeling of polysemy in the WordNet English lexical database. It exploits the concepts hierarchy (constituted by synsets), and the gloss defining each of these concepts. The result consists of rules set which enabled us to identify in a largely automated way, with a precision close to 91%, more than 2100 synsets pairs, connected by a regular polysemy relation. Our method also allows a partial word sense disambiguation of the definition associated with these synsets.Cette étude propose une analyse et une modélisation des relations de polysémie dans le lexique électronique anglais WordNet. Elle exploite pour cela la hiérarchie des concepts (représentés par des synsets), et la définition associée à chacun de ces concepts. Le résultat est constitué d'un ensemble de règles qui nous ont permis d'identifier d'une façon largement automatisée, avec une précision voisine de 91%, plus de 2100 paires de synsets liés par une relation de polysémie régulière. Notre méthode permet aussi une désambiguïsation lexicale partielle des mots de la définition associée à ces synsets

INRIA a CCSD electronic archive server

Hal-Diderot

From the Definitions of the "Trésor de la Langue Française" To a Semantic Database of the French Language

Author: Barque Lucie
Nasr Alexis
Polguère Alain
Publication venue: HAL CCSD
Publication date: 06/07/2010
Field of study

International audienceThe Definiens project aims at building a database of French lexical semantics that is formal and structured enough to allow for a fine-grained semantic access to the French lexicon—for such tasks as automatic extraction and computation. To achieve this in a relatively short time, we process the definitions of the Trésor de la Langue Française informatisé (TLFi), enriching them with an XML tagging that makes explicit their internal organization (roughly, genus and differentiae) and enhancing the components with semantic labels that explicit their role in the definition. There is, to our knowledge, no existing broad coverage database for the French lexicon that offers to researchers and NLP developers a structured decomposition of the meaning of lexical units. Definiens is an ongoing research that will hopefully fill this gap in the near future

HAL AMU

Hal-Diderot

Improvement of VerbNet-like resources by frame typing

Author: Barque Lucie
Constant Mathieu
Danlos Laurence
Publication venue: HAL CCSD
Publication date: 11/12/2016
Field of study

International audienceVerbenet is a French lexicon developed by " translation " of its English counterpart — VerbNet (Kipper-Schuler, 2005) — and treatment of the specificities of French syntax (Pradet et al., 2014; Danlos et al., 2016). One difficulty encountered in its development springs from the fact that the list of (potentially numerous) frames has no internal organization. This paper proposes a type system for frames that shows whether two frames are variants of a given alternation. Frame typing facilitates coherence checking of the resource in a " virtuous circle ". We present the principles underlying a program we developed and used to automatically type frames in Verbenet. We also show that our system is portable to other languages

INRIA a CCSD electronic archive server

Building a lexicon of French deverbal nouns from a semantically annotated corpus

Author: Balvet Antonio
Barque Lucie
Marin Rafael
Publication venue: HAL CCSD
Publication date: 01/01/2010
Field of study

International audienceThe ongoing project Nomage aims at describing the aspectual properties of deverbal nouns in an empirical way. It is centered on the development of two resources: a semantically annotated corpus of deverbal nouns, and an electronic lexicon. They are both presented in this paper, and emphasize how the semantic annotations of the corpus allow the lexicographic description of deverbal nouns to be validated, in particular their polysemy

CiteSeerX

INRIA a CCSD electronic archive server

Hal-Diderot

Dictionary-Ontology Cross-Enrichment Using TLFi and WOLF to enrich one another

Author: Barque Lucie
Eckard Emmanuel
Nasr Alexis
Sagot Benoît
Publication venue: Curran Associates, Inc.
Publication date: 15/12/2012
Field of study

International audienceIt has been known since Ide and Veronis that it is impossible to automatically extract an ontology structure from a dictionary, because that information is simply not present. We at- tempt to extract structure elements from a dictionary using clues taken from a formal ontology, and use these elements to match dictionary definitions to ontology synsets; this allows us to enrich the ontology with dictionary definitions, assign ontological structure to the dictionary, and disambiguate elements of definitions and synsets

HAL AMU

INRIA a CCSD electronic archive server

HAL-Paris 13

Hal-Diderot

Un Verbenet du français

Author: Barque Lucie
Constant Mathieu
Danlos Laurence
Nakamura Takuya
Pradet Quentin
Publication venue: ATALA (Association pour le Traitement Automatique des Langues)
Publication date: 07/09/2016
Field of study

International audienceVerbNet is a lexical resource for English verbs that has proven useful for NLP thanks to its high lexical and syntactic coverage and its systematic coding of thematic roles. Such a resource doesn’t exist for French. This has motivated us to develop a Verbenet for French. We present how we have developed Verbenet from VerbNet while using as far as possible the available lexical resources for French, and how the various French alternations are coded, focusing on differences with English (existence of pronominal forms, for example). This paper should allow an NLP researcher to use Verbenet in a simple and efficient way for a task such as semantic role labeling.VerbNet est une ressource lexicale pour les verbes anglais qui est largement utilisée en TAL du fait de sa bonne couverture lexicale et syntaxique et de son encodage systématique des rôles thématiques. Aucune ressource équivalente n'existe pour le français, ce qui nous a motivés pour développer un Verb@net du français. Nous présentons comment nous avons développé Verb@net à partir de VerbNet tout en utilisant au maximum les ressources lexicales existantes du français, et comment sont encodées les différentes alternances du français en mettant l'accent sur les différences avec l'anglais (l'existence de formes pronominales, par exemple). Cet article devrait permettre à un chercheur en TAL une utilisation simple et efficace de Verb@net pour une tâche comme l'annotation en rôles sémantiques

Hal-Diderot

Regular Polysemy in WordNet

Author: Barque Lucie
Chaumartin François-Régis
Publication venue: GSCL (Gesellschaft für Sprachtechnologie und Computerlinguistik)
Publication date: 01/01/2009
Field of study

International audienceThe importance of describing regular polysemy in a lexicon has often been outlined, especially in the field of natural language processing (for a good overview of this issue, see (Ravin and Leacock, 2000)). Unfortunately, no existing broad-coverage semantic lexicon has been built following this relatively recent advice. And since producing a broad coverage semantic lexicon is a very time-consuming task, one has to put this idea into practice on existing lexicons. WordNet is an appropriate lexical semantic resource for running this experiment as it is machine readable and has a wide coverage (Fellbaum, 1998). In this paper, we introduce a method to create regular polysemy patterns from WordNet data and to automatically detect their occurrences in the lexicon

INRIA a CCSD electronic archive server

Hal-Diderot

Improvement of VerbNet-like resources by frame typing

Author: Barque Lucie
Constant Mathieu
Danlos Laurence
Publication venue: HAL CCSD
Publication date: 11/12/2016
Field of study

INRIA a CCSD electronic archive server

De la simplicité en morphologie

Author: Barque Lucie
Haas Pauline
Huyghe Richard
Tribout Delphine
Publication venue: 'EDP Sciences'
Publication date: 01/01/2014
Field of study

International audienceLes unités lexicales simples ne font que rarement l’objet d’études en morphologie dérivationnelle, bien qu’elles soient le point de départ des mécanismes qui la sous-tendent. Nous nous proposons dans cet article d’élaborer et d’analyser un important corpus de noms simples du français. L’objectif est de vérifier l’hypothèse de Croft (1991) selon laquelle les noms simples dénotent prototypiquement des objets, contrairement aux noms construits, qui renvoient principalement à des actions ou à des propriétés, selon qu’ils sont dérivés de verbes ou d’adjectifs. Nous constituons dans un premier temps un corpus de noms simples, ce qui présuppose de définir cette notion, plus problématique qu’elle ne le paraît d’emblée. Nous proposons dans un second temps une annotation sémantique des quelque 3500 noms simples retenus, en noms d’objet, d’action ou de propriété, et nous détaillons la série de tests linguistiques employés pour établir cette classification. Il ressort de notre étude qu’environ un quart des noms simples ne dénotent pas (ou pas uniquement) des objets. Certains d’entre eux relèvent de classes nominales plus spécifiques, non réductibles aux trois catégories initialement considérées

EDP Sciences OAI-PMH repository (1.2.0)

INRIA a CCSD electronic archive server

Developing a French FrameNet: Methodology and First results

Author: Amsili Pascal
Barque Lucie
Benamara Farah
Candito Marie
De Chalendar Gaël
Djemaa Marianne
Haas Pauline
Huyghe Richard
Mathieu Yvette Yannick
Muller Philippe
Sagot Benoît
Vieu Laure
Publication venue: HAL CCSD
Publication date: 01/05/2014
Field of study

International audienceThe Asfalda project aims to develop a French corpus with frame-based semantic annotations and automatic tools for shallow semantic analysis. We present the ﬁrst part of the project: focusing on a set of notional domains, we delimited a subset of English frames, adapted them to French data when necessary, and developed the corresponding French lexicon. We believe that working domain by domain helped us to enforce the coherence of the resulting resource, and also has the advantage that, though the number of frames is limited (around a hundred), we obtain full coverage within a given domain

Scientific Publications of the University of Toulouse II Le Mirail

INRIA a CCSD electronic archive server

Open Archive Toulouse Archive Ouverte

HAL-CEA

Hal-Diderot